TISS: An Integrated Summarization System for TSC-3
نویسندگان
چکیده
In consideration of the previous workshop, we participate in TSC-3 to make improvements on important sentence extraction used in dry run of TSC-2. We formulate important sentence extraction as a combinational optimization problem that determines a set of sentences containing as many important information fragments as possible. In addition to the extraction method, we reinforce peripheral components such as sentence ordering, anaphora analysis and sentence compression to improve summary readability. We propose a remedy of chronological ordering by complementing presupposed information of each sentence. This paper reports mainly on important sentence extraction and sentence ordering.
منابع مشابه
Comparison of Feature Usage at TSC-3 Summarization Tasks
We participated in two summarization tasks at the TSC-3. We have introduced categorization of feature values for our summarization system, which is based on sentence extraction technique. The categorized values were used as features for generating a decision tree. We compared our summarization system using the categorization of feature values with the one using linear combination of features in...
متن کاملKernel-based Approach for Automatic Evaluation of Natural Language Generation Technologies: Application to Automatic Summarization
In order to promote the study of automatic summarization and translation, we need an accurate automatic evaluation method that is close to human evaluation. In this paper, we present an evaluation method that is based on convolution kernels that measure the similarities between texts considering their substructures. We conducted an experiment using automatic summarization evaluation data develo...
متن کاملTwo Different Summarization Methods at NTCIR3-TSC2: Coverage Oriented and Focus Oriented
We are conducting research on multi-document summarization, participating in a competition of sum-marization, TSC (Text Summarization Challenge) task organized by NTCIR-3 project. In a dry run, we conceived a new extraction method for multi-document summarization which extracts a set of sentences that maximizes coverage of an original text and minimizes redundancy of a summary. Thinking over th...
متن کاملNTT/NAIST's Text Summarization Systems for TSC-2
In this paper, we describe the following two approaches to summarization: (1) only sentence extraction, (2) sentence extraction + bunsetsu elimination. For both approaches, we use the machine learning algorithm called Support Vector Machines. We participated in both Task-A (single-document summarization task) and Task-B (multi-document summarization task) of TSC-2.
متن کاملYet Another Summarization System with Two Modules using Empirical Knowledge
We previously proposed a summarization system, GREEN, for Japanese newspaper editorials. However, GREEN is not suitable for summarizing ordinal newspaper articles which are different from newspaper editorials. To participate in subtasks A-1 and A-2 of TSC (text Summarization Challenge) in NTCIR-2, we developed a new summarization system from scratch which copes with both ordinal articles and ed...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004